AITopics | Carson City

Collaborating Authors

Carson City

Chain-of-Thought Reasoning In The Wild Is Not Always Faithful

Arcuschin, Iván, Janiak, Jett, Krzyzanowski, Robert, Rajamanoharan, Senthooran, Nanda, Neel, Conmy, Arthur

arXiv.org Artificial IntelligenceMar-19-2025

Chain-of-Thought (CoT) reasoning has significantly advanced state-of-the-art AI capabilities. However, recent studies have shown that CoT reasoning is not always faithful, i.e. CoT reasoning does not always reflect how models arrive at conclusions. So far, most of these studies have focused on unfaithfulness in unnatural contexts where an explicit bias has been introduced. In contrast, we show that unfaithful CoT can occur on realistic prompts with no artificial bias. Our results reveal non-negligible rates of several forms of unfaithful reasoning in frontier models: Sonnet 3.7 (16.3%), DeepSeek R1 (5.3%) and ChatGPT-4o (7.0%) all answer a notable proportion of question pairs unfaithfully. Specifically, we find that models rationalize their implicit biases in answers to binary questions ("implicit post-hoc rationalization"). For example, when separately presented with the questions "Is X bigger than Y?" and "Is Y bigger than X?", models sometimes produce superficially coherent arguments to justify answering Yes to both questions or No to both questions, despite such responses being logically contradictory. We also investigate restoration errors (Dziri et al., 2023), where models make and then silently correct errors in their reasoning, and unfaithful shortcuts, where models use clearly illogical reasoning to simplify solving problems in Putnam questions (a hard benchmark). Our findings raise challenges for AI safety work that relies on monitoring CoT to detect undesired behavior.

large language model, machine learning, natural language, (21 more...)

arXiv.org Artificial Intelligence

2503.08679

Country:

Europe (1.00)
Asia (1.00)
North America > United States > New York (0.28)
(3 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Leisure & Entertainment (0.68)
Media > Film (0.46)
Education (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

Land deal leads to Carson City company that's still computing

#artificialintelligenceFeb-18-2018, 00:50:13 GMT

According to Al Fiegehen, chief executive officer of Cubix Corporation in Carson City, the computer industry is about to explode at the same rate it did in the 1980s when computer technology forever changed the world.

artificial intelligence, carson city, machine learning, (14 more...)

#artificialintelligence

Country: North America > United States > Nevada > Carson City (0.78)

Industry:

Government (0.51)
Information Technology (0.32)
Transportation (0.31)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.30)

Add feedback

I-athlon: Towards A Multidimensional Turing Test

Adams, Sam S. (IBM T. J. Watson Research Center) | Banavar, Guruduth (IBM T. J. Watson Research Center) | Campbell, Murray (IBM T. J. Watson Research Center)

AI MagazineApr-13-2016

While the Turing test is a well-known method for evaluating machine intelligence, it has a number of drawbacks that make it problematic as a rigorous and practical test for assessing progress in general-purpose AI. For example, the Turing test is deception based, subjectively evaluated, and narrowly focused on language use. We suggest that a test would benefit from including the following requirements: focus on rational behavior, test several dimensions of intelligence, automate as much as possible, score as objectively as possible, and allow incremental progress to be measured. In this article we propose a methodology for designing a test that consists of a series of events, analogous to the Olympic Decathlon, which complies with these requirements. The approach, which we call the I-athlon, is intended to ultimately enable the community to evaluate progress towards machine intelligence in a practical and repeatable way.

chess, creativity & intelligence, turing test, (21 more...)

AI Magazine

Country: North America > United States > Nevada > Carson City (0.14)

Industry:

Leisure & Entertainment > Sports (0.49)
Leisure & Entertainment > Games > Chess (0.47)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.94)
Information Technology > Artificial Intelligence > Robots (0.93)
(3 more...)

Add feedback

How to Write Science Questions that Are Easy for People and Hard for Computers

Davis, Ernest (New York University)

AI MagazineApr-13-2016

As a challenge problem for AI systems, I propose the use of hand-constructed multiple-choice tests, with problems that are easy for people but hard for computers. Specifically, I discuss techniques for constructing such problems at the level of a fourth-grade child and at the level of a high-school student. For the fourth grade level questions, I argue that questions that require the understanding of time, impossible or pointless scenarios, of causality, of the human body, or of sets of objects, and questions that require combining facts or require simple inductive arguments of indeterminate length can be chosen to be easy for people, and are likely to be hard for AI programs, in the current state of the art. For the high-school level, I argue that questions that relate the formal science to the realia of laboratory experiments or of real-world observations are likely to be easy for people and hard for AI programs. I argue that these are more useful benchmarks than existing standardized tests such as the SATs or Regents tests. Since the questions in standardized tests are designed to be hard for people, they often leave many aspects of what is hard for computers but easy for people untested

beaker, commonsense reasoning, neural network, (20 more...)

AI Magazine

Country:

North America > United States > New York (0.29)
North America > United States > Nevada > Carson City (0.14)
North America > United States > California > San Mateo County (0.14)

Industry:

Education > Educational Setting > K-12 Education > Secondary School (0.74)
Education > Educational Setting > K-12 Education > Primary School (0.54)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Commonsense Reasoning (0.69)
Information Technology > Artificial Intelligence > Cognitive Science > Problem Solving (0.68)
Information Technology > Artificial Intelligence > Challenges (0.66)

Add feedback

Seven Challenges in Parallel SAT Solving

Hamadi, Youssef (Microsoft Research, 7 JJ Thomson Avenue, Cambridge CB3 0FB, United Kingdom) | Wintersteiger, Christoph (Microsoft Research, 7 JJ Thomson Avenue, Cambridge CB3 0FB, United Kingdom)

AI MagazineJul-5-2013

This paper provides a broad overview of the situation in Parallel SAT Solving. A set of challenges to researchers is presented which, we believe, must be met to ensure the practical applicability of Parallel SAT Solvers in the future. All these challenges are described informally, but put into perspective with related research results, and a (subjective) grading of difficulty for each of them is provided.

constraint-based reasoning, solver, survey article, (21 more...)

AI Magazine

Country:

North America > United States > New York (0.14)
North America > United States > Nevada > Carson City (0.14)
North America > United States > California (0.14)
Europe > Switzerland > Zürich > Zürich (0.14)

Industry: Information Technology (0.47)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Constraint-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning > Search (0.95)

Add feedback

Artificial Intelligence: Some Legal Approaches and Implications

Willick, Marshall S.

AI MagazineJun-15-1983

Various groups of ascertainable individuals have been granted the status of "persons" under American law, while that status has been denied to other groups. This article examines various analogies that might be drawn by courts in deciding whether to extend "person" status to intelligent machines, and the limitations that might be placed upon such recognition. As an alternative analysis, this article questions the legal status of various human/machine interfaces, and notes the difficulty in establishing an absolute point beyond which legal recognition will not extend.

computer, government & the courts, neural network, (18 more...)

AI Magazine

Country: North America > United States > Nevada > Carson City (0.14)

Genre: Overview (0.48)

Industry:

Law > Civil Rights & Constitutional Law (1.00)
Health & Medicine > Therapeutic Area (1.00)
Government > Regional Government > North America Government > United States Government (1.00)
Law > Government & the Courts (0.93)

Technology:

Information Technology > Artificial Intelligence > Cognitive Science (0.46)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.46)

Add feedback